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Abstract. Some of the important inequalities associated with quan- 
tum entropy are immediate algebraic consequences of the Hansen-Peder- 
sen- Jensen inequality. A general argument is given in terms of the ma- 
trix perspective of an operator convex function. A matrix analogue of 
MarechaPs extended perspectives provides additional inequalities, in- 
cluding a p + q < 1 result of Lieb. 



1. Introduction 

Several elegant proofs of inequalities due to Lieb [7] and to Lieb and 
Ruskai [8], have recently appeared (see Nielsen and Petz [12], Ruskai [13]). 
We prove that one can use the "fully quantized" Jensen inequality of Frank 
Hansen and Gert Pedersen [5] to eliminate all vestiges of analysis from their 
bivariable arguments. We then show that a matrix version of Marechal's 
extended perspectives can be used to formulate more elaborate joint matrix 
inequalities. In the concluding section we suggest some natural links between 
matrix convexity theory and the foundations of quantum information theory. 

Since the basic difficulties are already apparent in finite dimensions, we 
have restricted our attention to finite matrices, and we have avoided any 
attempt at full generality even in that context. 

I am very much indebted to Mary Beth Ruskai, who corrected a number 
of errors in my first manuscript, and who made me aware of Lieb's result in 
the third section. 

2. The classical and matrix notions of perspectives 

Given a convex function / defined on a convex set K C R n , the perspective 
g is defined on the subset 

L = {(x, t) : t > and x/t G K} 

by 

g(x,t) = f(x/t)t 
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(see [6]). It is a simple exercise to verify that g(x,t) is a jointly convex 
function in the sense that if < c < 1, then 

g{cxi + (1 - c)x 2 ,ct 1 + (1 - c)t 2 ) < cg(x 1 ,t 1 ) + (1 - c)g(x 2 ,t 2 ). 

An elementary but important example is provided by the continuous convex 
function f(x) = xlogx, with /(0) = defined on [0, oo) C R. It follows that 
the perspective function 

g(x, t) = t— log - = x log x — x log t 

is jointly convex. Letting p = (pi) and q = {qi) be finite probability measures 
with pi > and qi > 0, the convexity of / implies that the classical entropy 

h (p) = - ^PilogPi 

is concave, and the convexity of g implies that the relative entropy 

(q,p) h-> H(q\\p) = ^pilogpi -pilogqi 

is jointly convex on pairs of probability measures. 

We recall that if / : [a, b] — ► M is continuous, and T is an n x n self-adjoint 
matrix with spectrum in [a, b], then we can define f n (T) by spectral theory 
(or by using a basis in which T is diagonal). / is said to be matrix convex if 
for each n € N, the corresponding function f n is convex on the self-adjoint 
n x n matrices with spectrum in [a, b]. We usually omit the subscript n. 

Theorem 2.1 (Hansen and Pedersen [5]). If f is matrix convex, and A and 
B are m x n matrices with A* A + B*B = I n , then 

(2.1) f n (A*TA + B*TB) < A* f m (T)A + B*f m (T)B. 

We note that their proof does not entail any analysis, but rather is based 
on a shrewd sequence of matrix manipulations. As pointed out by Winkler 
|14j . the result may be restated that a real function / on an interval in 1R 
is a matrix convex function if and only if the supergraphs of the f n form a 
matrix convex system. 

We begin with some matrix conventions. Given matrices L and R, we let 
[L,R] = LR- RL. Let us suppose that L > and R > 0. If [L,R] = 0, 
i.e., the matrices commute, then we may find a basis in which both matrices 
are diagonalized. It follows that LR > 0, = 0, and we may unam- 

biguously write for the quotient. We also recall that for any continuous 
function /, f(L) commutes with any operator commuting with L (including 
L itself). Using simultaneously diagonalized matrices, it is evident that we 
have relations such as log Li? -1 = logL — logi?. 

Theorem 2.2. Suppose that f is operator convex. When restricted to 
positve commuting matrices, the "perspective function" 

(2.2) (L,R)^g(L,R) = f(^)R 
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is jointly convex in the sense that if [Lj, Rj] = (j = 1,2), L = cL\ + (1 — 
c)L2, R = cR\ + (1 — c)i?2, and < c < 1, then 

(2.3) g(L,R) <cg(L 1 ,R 1 ) + (l-c)g(L 2 ,R 2 ). 

Proof. The matrices A = {cR^R' 1 / 2 and B = ((l-c)R 2 ) 1/2 R~ 1/2 satisfy 
A* A + B*B = I. From Theorem 2.1, 



g(L,R) 



= RV 2 f{R-V 2 LR- x l*)R x l 2 

- Ri/2 f{ A ik) A+B i^) B ) Ri/2 

= (^l) 1/2 / (^) (Ci?!) 1 / 2 + ((1 - C)^/ ( 

= C5 (L 1 ,i? 1 ) + (l-c) 5 (L 2 ,i? 2 ). 



1/2 



§)(d-'-)^) J/2 



□ 



The following is due to Lieb and Ruskai [8] (a related early discussion 
may be found in Lindblad [U]). 

Corollary 2.3. The relative entropy function 

(p,<j) i— > S(p\\a) = Trace p log p — plogcr 
is jointly convex on the strictly positive n x n density matrices p,cr. 

Proof. We let M n have the usual Hilbert space structure determined by 
(X,Y) = Trace XY* . Given positive density matrices a and p, we define 
operators R and L on M n by L{X) = aX and R(X) = Xp. Then we have 
that L and R are commuting positive operators on the Hilbert space M n . 
On the other hand the function f(x) = xlogx is operator convex (see [I], 
p. 123), and thus 

S(p\\a) = (L^log|(/),J) = {g(L,R)(I),I) 
is jointly convex. □ 

The following is due to Lieb 0. It was subsequently used by Lieb and 
Ruskai to prove strong subadditivity for relative entropy [8|. 

Corollary 2.4. J/0 < s < 1, then the function 

F(A, B) = Trace A S K*B 1 ~ S K 
is jointly concave on the strictly positive n x n matrices A, B. 
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Proof. Since f{t) = —t s is operator convex (see [I] Th.5.1.9), g(L,R) = 
—L s R l ~ s is jointly convex for appropriately commuting operators. Again 
using the Hilbert space structure on M n , we let L(X) = AX and R(X) = 
XB. It follows that 

(A,B) i-» —Trace A S K*B 1 ~ S K = {g(L, R)(K*), K*) 
is jointly convex. □ 

Various generalized entropies may be handled in much the same manner. 

3. Marechal's perspectives 

P. Marechal has recently introduced an interesting generalization of per- 
spectivity for convex functions [10] . This also has a natural matrix 

version. For this purpose we use Hansen and Pedersen's earlier result [3J. 

Theorem 3.1. If f is matrix convex, and /(0) < 0, and that A and B are 
m x n matrices with A* A + B*B < I n , then 

fn(A*TA + B*TB) < A*f m (T)A + B* f m (T)B. 

Given continuous functions / and h, and commuting positive matrices L 
and R, we define 

Theorem 3.2. Suppose that f is matrix convex with /(0) < and that h 
is matrix concave with h > 0. Then (L,R) i— > (fAh)(L,R) is jointly convex 
on postive commuting matrices L,R in the sense of (2.3). 

Proof. Let us suppose that L = cL\ + (1 — c)L2 and R = cR\ + (1 — c)i?2 
where [Lj,Rj] = 0. We have that ch(Ri) + (1 - c)h(R 2 ) < h(R), hence 

A = c 1 /2/ l (^ 1 )i/2 /l ( i? )-i/2 

B = (1 - c) 1 / 2 h(R 2 ) 1 / 2 h(Ry 1 / 2 

satisfy 

A* A + B*B 

= /i(i?)- 1/2 c/i(i?i)/i(i?) 1/2 + hiRy^il - c)h(R 2 )h(R)- 1 / 2 
< h(R)- 1/2 h(R)h(R)- 1/2 I = I. 
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/2 



It follows from Theorem 3.1 that 
(fAh)(L,R) 
= hiRf^fi^Ry^LhiRY^hiR) 1 ' 2 

- h ^{ A im) A+B ixk) B ) m) '' 2 
5 h ^ 1/2A "f {m) Ah ^ 112 + h ^ 1/2B "f {m) Bh ^' 

= ch(R ^ {m) h(Rl)i ' 2 + (1 " M ' 2 > {m) h{R " )lt2 

= c(fAh){L x ,R x ) + (1 - c)(fAh)(L 2 ,R 2 ). 

□ 

To illustrate this construction, we reprove a result of Lieb [7]. 
Corollary 3.3. Suppose that < p, q and that p + q < 1. Then the function 

(A, B) i-> Trace A q X*B p X 
is jointly concave on the positive n x n matrices. 

Proof. Since p + q<l, p + q is & convex combination of q and 1, i.e., we 
may choose < t < 1 with p + q = (1 — t)q + tl. If we let q = s, then 

p = -tq + t = (1 - q)t = (1 - s)i. 

Thus it suffices to show that if < s, t < 1, then 

(A 5) i — ^ -Trace A s X*B^- s)t X 

is jointly convex. The functions f(x) = —x s and h(y) = are operator 
convex and concave, respectively, and 

(fAh)(L,R) = h(R)f (JL^j = -R^ = -L'R^. 

If we let L(X) = AX and R(X) = XB for X G M n , then it follows from 
the above theorem that 

(A, B) ^ -Trace A s X*B {1 ~ s)t X = ((fAh)(L,R)(X*),X*) 

is jointly convex. □ 

4. MATRIX CONVEXITY 

Perhaps the most interesting aspect of MarechaFs construction is that 
it behaves well under the Fenchel-Legendre transform, and under iteration. 
S0ren Winkler formulated an analogue of the Fenchel-Legendre duality for 
matrix convex functions [14], but the transforms are generally set- valued 
mappings. Further progress might result if one could reformulate his theory 
in terms of commuting pairs. It should also be noted that other constructions 
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in classical convexity theory, such as the linear fractional transformations of 
convex functions (see [2]) might also have matrix generalizations. 

Until recently the theory of matrix convexity has suffered from a lack of 
examples and applications. With the advent of quantum information theory 
(QIT), this situation has dramatically changed. QIT provides a wealth of 
remarkable, purely non-classical techniques that might clarify some of the 
conceptual problems in matrix convexity theory. On the other hand, it seems 
likely that matrix convexity will provide an appropriate framework for many 
of the calculations in QIT. A striking illustration of this phenomenon can 
be found in [3]. 
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